A longitudinal survey of Internet host reliability - Reliable Distributed Systems, 1995. Proceedings., 14th Symposium on
نویسندگان
چکیده
An accurate estimate of host reliability is important for correct analysis of many fault-tolerance and replication mechanisms. In a previous study, we estimated host system reliability by querying a large number of hosts to find how long they had been functioning, estimating the mean timeto-failure (MTTF) and availability from those measures, and in turn deriving an estimate of the mean time-to-repair (MTTR). Howevel; this approach had a bias towards more reliable hosts that could result in overestimating MTTR and underestimating availability. To address this bias we have conducted a second experiment, using a fault-tolerant replicated monitoring tool. This tool directly measures TTe TTR, and availability by polling many sites frequently from several locations. We find that these more accurate results generally confirm and improve our earlier estimates, particularly for TTR. We also find that failure and repair are unlikely to follow Poisson processes.
منابع مشابه
The Challenge of Creating Productive Collaborating Information Assurance Communities via Internet Research and Standards
1 Invited position paper for the IEEE Symposium on Reliable Distributed Systems (SRDS), Panel on Reliability and Security of Distributed and Mobile Systems. Supported by the Aerospace Institute, The Aerospace IRAD Corporate Research Initiative, the DARPA Cyber Panel program, and other DoD programs. 2 The author has been a DARPA Principal Investigator on a number of networking and information as...
متن کاملGuest Editorial: Special Issue on Reliable Distributed Systems
DESIGNERS of distributed systems are concerned with developing architectures, networking, software, algorithms, and applications. While research in this direction addresses some of the fundamental issues in distributed computing, topics related to modeling and simulation of multiple processor systems, real-time operation, reliability, fault tolerance, information assurance, performance measurem...
متن کاملHealth status assessment via the World Wide Web.
We explored the use of the World Wide Web to collect health status information for medical outcomes research. The RAND 36-Item Health Survey 1.0 (RAND-36), which contains the 36 multiple-choice questions of the Medical Outcomes Study SF-36 "Short Form" and differs only in its simplified scoring scheme, was made available for anonymous use on the Internet. Participation in the survey was invited...
متن کاملMessage from the Technical Program Committee Co-chairs
The IEEE International Symposium on Reliable Distributed Systems (SRDS) is a premier forum for researchers and practitioners interested in distributed systems design, development, and evaluation, particularly with emphasis on reliability, availability, safety, security, trust, and real time. SRDS is celebrating its 30 anniversary and we are happy to see that its research program remains as vibr...
متن کاملEducation Teaching Experience Work Experience Journal Publications Refereed Conference Publications Refereed Workshop Publications Professional Service Academic Software Projects
ions. In Proceedings of the 21st Symposium on Operating Systems Principles (SOSP), Stevenson, WA, October 2007. [6] M. Krohn, A. Yip, M. Brodsky, R. Morris, and M. Walfish. A World Wide Web Without Walls. In Proceedings of the 6th ACM Workshop on Hot Topics in Networks (HotNets), Atlanta, GA, November 2007. [7] J. Li, M. Krohn, D. Mazières, and D. Shasha. Secure untrusted data repository (SUNDR...
متن کامل